Learning Features of Music from Scratch

نویسندگان

  • John Thickstun
  • Zaïd Harchaoui
  • Sham M. Kakade
چکیده

We introduce a new large-scale music dataset, MusicNet, to serve as a source of supervision and evaluation of machine learning methods for music research. MusicNet consists of hundreds of freely-licensed classical music recordings by 10 composers, written for 11 instruments, together with instrument/note annotations resulting in over 1 million temporal labels on 34 hours of chamber music performances under various studio and microphone conditions. We define a multi-label classification task to predict notes in musical recordings, along with an evaluation protocol. We benchmark several machine learning architectures for this task: i) learning from “hand-crafted” spectrogram features; ii) end-to-end learning with a neural net; iii) end-to-end learning with a convolutional neural net. We show that several end-to-end learning proposals outperform approaches based on learning from hand-crafted audio features.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language, Music, and Brain

Introduction: Over the last centuries, scientists have been trying to figure out how the brain is learning the language. By 1980, the study of brain-language relationships was based on the study of human brain damage. But since 1980, neuroscience methods have greatly improved. There is controversy about where music, composition, or the perception of language and music are in the brain, or wheth...

متن کامل

شناسایی خودکار سبک موسیقی

Nowadays, automatic analysis of music signals has gained a considerable importance due to the growing amount of music data found on the Web. Music genre classification is one of the interesting research areas in music information retrieval systems. In this paper several techniques were implemented and evaluated for music genre classification including feature extraction, feature selection and m...

متن کامل

Scratch: creating and sharing interactive media

Scratch (http://scratch.mit.edu) is a new programming language that makes it easy to create interactive stories, music, games, art, and animations – and share those creations on the web. In a little over a year, more than 47,000 members have contributed over 300,000 projects online. Scratch presents powerful opportunities for learning – as people program and share with Scratch, they learn to th...

متن کامل

تعبیر عناصر موسیقی و معماری با زبان مشترک نمونه‌موردی موسیقی و معماری سنتی ایران

Every nation has its own ideals that objectifying them is by culture. In the process of this transformation, the architecture and music which come from the culture of each region have a fundamental role. Architecture and music have many similarities in conceptual directions, space, shape and morphology. They also have similarities in the basic and fundamental principles such as proportions, rhy...

متن کامل

Learning Sparse Feature Representations for Music Annotation and Retrieval

We present a data-processing pipeline based on sparse feature learning and describe its applications to music annotation and retrieval. Content-based music annotation and retrieval systems process audio starting with features. While commonly used features, such as MFCC, are handcrafted to extract characteristics of the audio in a succinct way, there is increasing interest in learning features a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1611.09827  شماره 

صفحات  -

تاریخ انتشار 2016